15 research outputs found
Quantifying Social Biases Using Templates is Unreliable
Recently, there has been an increase in efforts to understand how large
language models (LLMs) propagate and amplify social biases. Several works have
utilized templates for fairness evaluation, which allow researchers to quantify
social biases in the absence of test sets with protected attribute labels.
While template evaluation can be a convenient and helpful diagnostic tool to
understand model deficiencies, it often uses a simplistic and limited set of
templates. In this paper, we study whether bias measurements are sensitive to
the choice of templates used for benchmarking. Specifically, we investigate the
instability of bias measurements by manually modifying templates proposed in
previous works in a semantically-preserving manner and measuring bias across
these modifications. We find that bias values and resulting conclusions vary
considerably across template modifications on four tasks, ranging from an 81%
reduction (NLI) to a 162% increase (MLM) in (task-specific) bias measurements.
Our results indicate that quantifying fairness in LLMs, as done in current
practice, can be brittle and needs to be approached with more care and caution
The Bias Amplification Paradox in Text-to-Image Generation
Bias amplification is a phenomenon in which models increase imbalances
present in the training data. In this paper, we study bias amplification in the
text-to-image domain using Stable Diffusion by comparing gender ratios in
training vs. generated images. We find that the model appears to amplify
gender-occupation biases found in the training data (LAION). However, we
discover that amplification can largely be attributed to discrepancies between
training captions and model prompts. For example, an inherent difference is
that captions from the training data often contain explicit gender information
while the prompts we use do not, which leads to a distribution shift and
consequently impacts bias measures. Once we account for various distributional
differences between texts used for training and generation, we observe that
amplification decreases considerably. Our findings illustrate the challenges of
comparing biases in models and the data they are trained on, and highlight
confounding factors that contribute to bias amplification
Recommended from our members
The 16th Data Release of the Sloan Digital Sky Surveys: First Release from the APOGEE-2 Southern Survey and Full Release of eBOSS Spectra
This paper documents the 16th data release (DR16) from the Sloan Digital Sky Surveys (SDSS), the fourth and penultimate from the fourth phase (SDSS-IV). This is the first release of data from the Southern Hemisphere survey of the Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2); new data from APOGEE-2 North are also included. DR16 is also notable as the final data release for the main cosmological program of the Extended Baryon Oscillation Spectroscopic Survey (eBOSS), and all raw and reduced spectra from that project are released here. DR16 also includes all the data from the Time Domain Spectroscopic Survey and new data from the SPectroscopic IDentification of ERosita Survey programs, both of which were co-observed on eBOSS plates. DR16 has no new data from the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey (or the MaNGA Stellar Library "MaStar"). We also preview future SDSS-V operations (due to start in 2020), and summarize plans for the final SDSS-IV data release (DR17)
The 16th Data Release of the Sloan Digital Sky Surveys: First Release from the APOGEE-2 Southern Survey and Full Release of eBOSS Spectra
This paper documents the 16th data release (DR16) from the Sloan Digital Sky Surveys (SDSS), the fourth and penultimate from the fourth phase (SDSS-IV). This is the first release of data from the Southern Hemisphere survey of the Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2); new data from APOGEE-2 North are also included. DR16 is also notable as the final data release for the main cosmological program of the Extended Baryon Oscillation Spectroscopic Survey (eBOSS), and all raw and reduced spectra from that project are released here. DR16 also includes all the data from the Time Domain Spectroscopic Survey and new data from the SPectroscopic IDentification of ERosita Survey programs, both of which were co-observed on eBOSS plates. DR16 has no new data from the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey (or the MaNGA Stellar Library "MaStar"). We also preview future SDSS-V operations (due to start in 2020), and summarize plans for the final SDSS-IV data release (DR17)
The 16th Data Release of the Sloan Digital Sky Surveys : First Release from the APOGEE-2 Southern Survey and Full Release of eBOSS Spectra
This paper documents the 16th data release (DR16) from the Sloan Digital Sky Surveys (SDSS), the fourth and penultimate from the fourth phase (SDSS-IV). This is the first release of data from the Southern Hemisphere survey of the Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2); new data from APOGEE-2 North are also included. DR16 is also notable as the final data release for the main cosmological program of the Extended Baryon Oscillation Spectroscopic Survey (eBOSS), and all raw and reduced spectra from that project are released here. DR16 also includes all the data from the Time Domain Spectroscopic Survey and new data from the SPectroscopic IDentification of ERosita Survey programs, both of which were co-observed on eBOSS plates. DR16 has no new data from the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey (or the MaNGA Stellar Library "MaStar"). We also preview future SDSS-V operations (due to start in 2020), and summarize plans for the final SDSS-IV data release (DR17).Peer reviewe
The Seventeenth Data Release of the Sloan Digital Sky Surveys: Complete Release of MaNGA, MaStar and APOGEE-2 Data
This paper documents the seventeenth data release (DR17) from the Sloan Digital Sky Surveys; the fifth and final release from the fourth phase (SDSS-IV). DR17 contains the complete release of the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, which reached its goal of surveying over 10,000 nearby galaxies. The complete release of the MaNGA Stellar Library (MaStar) accompanies this data, providing observations of almost 30,000 stars through the MaNGA instrument during bright time. DR17 also contains the complete release of the Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2) survey which publicly releases infra-red spectra of over 650,000 stars. The main sample from the Extended Baryon Oscillation Spectroscopic Survey (eBOSS), as well as the sub-survey Time Domain Spectroscopic Survey (TDSS) data were fully released in DR16. New single-fiber optical spectroscopy released in DR17 is from the SPectroscipic IDentification of ERosita Survey (SPIDERS) sub-survey and the eBOSS-RM program. Along with the primary data sets, DR17 includes 25 new or updated Value Added Catalogs (VACs). This paper concludes the release of SDSS-IV survey data. SDSS continues into its fifth phase with observations already underway for the Milky Way Mapper (MWM), Local Volume Mapper (LVM) and Black Hole Mapper (BHM) surveys
A nationwide study of adults admitted to hospital with diabetic ketoacidosis or hyperosmolar hyperglycaemic state and COVIDâ19
AimsTo investigate characteristics of people hospitalized with coronavirus-disease-2019 (COVID-19) and diabetic ketoacidosis (DKA) or hyperosmolar hyperglycaemic state (HHS), and to identify risk factors for mortality and intensive care admission.Materials and methodsRetrospective cohort study with anonymized data from the Association of British Clinical Diabetologists nationwide audit of hospital admissions with COVID-19 and diabetes, from start of pandemic to November 2021. The primary outcome was inpatient mortality. DKA and HHS were adjudicated against national criteria. Age-adjusted odds ratios were calculated using logistic regression.ResultsIn total, 85 confirmed DKA cases, and 20 HHS, occurred among 4073 people (211 type 1 diabetes, 3748 type 2 diabetes, 114 unknown type) hospitalized with COVID-19. Mean (SD) age was 60 (18.2)âyears in DKA and 74 (11.8)âyears in HHS (pâ<â.001). A higher proportion of patients with HHS than with DKA were of non-White ethnicity (71.4% vs 39.0% pâ=â.038). Mortality in DKA was 36.8% (nâ=â57) and 3.8% (nâ=â26) in type 2 and type 1 diabetes respectively. Among people with type 2 diabetes and DKA, mortality was lower in insulin users compared with non-users [21.4% vs. 52.2%; age-adjusted odds ratio 0.13 (95% CI 0.03-0.60)]. Crude mortality was lower in DKA than HHS (25.9% vs. 65.0%, pâ=â.001) and in statin users versus non-users (36.4% vs. 100%; pâ=â.035) but these were not statistically significant after age adjustment.ConclusionsHospitalization with COVID-19 and adjudicated DKA is four times more common than HHS but both associate with substantial mortality. There is a strong association of previous insulin therapy with survival in type 2 diabetes-associated DKA
A nanotube/polymer composite biosensing thin-film transistor platform for C-reactive protein detection
In the present study, a back-gate thin-film transistor (TFT) based on a single-walled carbon nanotubes (SWCNTs)/poly(9,9-dioctylfluorene-co-bithiophene) (F8T2) composite semiconductor is proposed as electronic label-free immunosensor for the detection of the C-reactive protein (CRP). The nano-composite semiconducting ultra-thin film is deposited on the channel area of the TFT and is functionalized with a monoclonal anti-CRP antibody coating acting as biorecognition layer. The immunesensor was then used for the selective detection of the CRP both in phosphate buffered saline (PBS) solution and in human serum samples. The proposed immunosensor exhibits high current (10 - 20 ÎŒA) and good field-effect carriers mobility. Furthermore, the CRP could be revealed at a detection limit as low as 1 mg/L and in a wide dynamic range (from 0.4 nM to 2.2 ÎŒM)
Solvent-gated thin-film-transistors
Electrical double layer (EDL) thin film transistors (TFTs) are an interesting class of transistors that use an electrolyte as the gating medium. Recently it has been demonstrated that pure organic solvents can also be used as gating media for TFTs without the addition of exogenous electrolytes. Here we present a systematic study of the performances of TFTs based on two different semiconductors (P3HT and ZnO) gated through nine different solvents either pure or loaded with NaCl. The nature of the solvent impacts the transfer characteristics of the TFT through a change in the gating capacitance while the threshold voltage remains unaffected. Depending on the polarity of solvents, addition of NaCl gives rise to different responses. TFTs gated through highly polar solvents are unaffected by the salt concentration while for low polarity solvents the output current increases with salt up to a plateau. Furthermore, when the semiconductor surface is covered with a high capacitance thin dielectric layer, the TFT output current becomes dependent on the NaCl concentration also for high polarity solvents. This phenomenology was rationalized considering the different contributions of Helmholtz and Guy-Chapman EDLs to the capacitance and the dielectric saturation that decreases the solvent dielectric constant within the Helmholtz EDL